A Review Corpus for Argumentation Analysis
نویسندگان
چکیده
The analysis of user reviews has become critical in research and industry, as user reviews increasingly impact the reputation of products and services. Many review texts comprise an involved argumentation with facts and opinions on different product features or aspects. Therefore, classifying sentiment polarity does not suffice to capture a review’s impact. We claim that an argumentation analysis is needed, including opinion summarization, sentiment score prediction, and others. Since existing language resources to drive such research are missing, we have designed the ArguAna TripAdvisor corpus, which compiles 2,100 manually annotated hotel reviews balanced with respect to the reviews’ sentiment scores. Each review text is segmented into facts, positive, and negative opinions, while all hotel aspects and amenities are marked. In this paper, we present the design and a first study of the corpus. We reveal patterns of local sentiment that correlate with sentiment scores, thereby defining a promising starting point for an effective argumentation analysis.
منابع مشابه
Argumentation for Scientific Claims in a Biomedical Research Article
This paper provides an analysis of some argumentation in a biomedical genetics research article as a step towards developing a corpus of articles annotated to support research on argumentation. We present a specification of several argumentation schemes and inter-argument relationships to be annotated.
متن کاملFrom Discourse Analysis to Argumentation Schemes and Back: Relations and Differences
In argumentation theory, argumentation schemes are abstract argument forms expressed in natural language, commonly used in everyday conversational argumentation. In computational linguistics, discourse analysis have been conducted to identify the discourse structure of connected text, i.e. the nature of the discourse relationships between sentences. In this paper, we propose to couple these two...
متن کاملIdentifying Argumentation Schemes in Genetics Research Articles
This paper presents preliminary work on identification of argumentation schemes, i.e., identifying premises, conclusion and name of argumentation scheme, in arguments for scientific claims in genetics research articles. The goal is to develop annotation guidelines for creating corpora for argumentation mining research. This paper gives the specification of ten semantically distinct argumentatio...
متن کاملTowards Creation of a Corpus for Argumentation Mining the Biomedical Genetics Research Literature
Argumentation mining involves automatically identifying the premises, conclusion, and type of each argument as well as relationships between pairs of arguments in a document. We describe our plan to create a corpus from the biomedical genetics research literature, annotated to support argumentation mining research. We discuss the argumentation elements to be annotated, theoretical challenges, a...
متن کاملArgumentative meanings and their stylistic configurations in clinical research publications
The paper reports on the results of an exploratory study into the topical organisation and stylistic features of argumentation in a corpus of ophthalmic clinical research papers. The study responds to the need for systematised and generalisable argumentation models in knowledgeintensive fields. We present here a schematised superstructure of the arguments from the corpus, charting the configura...
متن کامل